Serveur d'exploration sur la recherche en informatique en Lorraine

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Model-based classification with dissimilarities: a maximum likelihood approach

Identifieur interne : 004358 ( Main/Exploration ); précédent : 004357; suivant : 004359

Model-based classification with dissimilarities: a maximum likelihood approach

Auteurs : Eugène-Patrice Ndong Nguéma [Cameroun] ; Guillaume Saint-Pierre [France]

Source :

RBID : ISTEX:D91E26C283F26AC855AF776795F9D320D1DFEA03

English descriptors

Abstract

Abstract: Most of classification problems concern applications with objects lying in an Euclidean space, but, in some situations, only dissimilarities between objects are known. We are concerned with supervised classification analysis from an observed dissimilarity table, which task is classifying new unobserved or implicit objects (only known through their dissimilarity measures with previously classified ones forming the training data set) into predefined classes. This work concentrates on developing model-based classifiers for dissimilarities which take into account the measurement error w.r.t. Euclidean distance. Basically, it is assumed that the unobserved objects are unknown parameters to estimate in an Euclidean space, and the observed dissimilarity table is a random perturbation of their Euclidean distances of gaussian type. Allowing the distribution of these perturbations to vary across pairs of classes in the population leads to more flexible classification methods than usual algorithms. Model parameters are estimated from the training data set via the maximum likelihood (ML) method, and allocation is done by assigning a new implicit object to the group in the population and positioning in the Euclidean space maximizing the conditional group likelihood with the estimated parameters. This point of view can be expected to be useful in classifying dissimilarity tables that are no longer Euclidean due to measurement error or instabilities of various types. Two possible structures are postulated for the error, resulting in two different model-based classifiers. First results on real or simulated data sets show interesting behavior of the two proposed algorithms, ant the respective effects of the dissimilarity type and of the data intrinsic dimension are investigated. For these latter two aspects, one of the constructed classifiers appears to be very promising. Interestingly, the data intrinsic dimension seems to have a much less adverse effect on our classifiers than initially feared, at least for small to moderate dimensions.

Url:
DOI: 10.1007/s10044-008-0105-2


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct">
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Model-based classification with dissimilarities: a maximum likelihood approach</title>
<author>
<name sortKey="Ndong Nguema, Eugene Patrice" sort="Ndong Nguema, Eugene Patrice" uniqKey="Ndong Nguema E" first="Eugène-Patrice" last="Ndong Nguéma">Eugène-Patrice Ndong Nguéma</name>
</author>
<author>
<name sortKey="Saint Pierre, Guillaume" sort="Saint Pierre, Guillaume" uniqKey="Saint Pierre G" first="Guillaume" last="Saint-Pierre">Guillaume Saint-Pierre</name>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:D91E26C283F26AC855AF776795F9D320D1DFEA03</idno>
<date when="2008" year="2008">2008</date>
<idno type="doi">10.1007/s10044-008-0105-2</idno>
<idno type="url">https://api.istex.fr/ark:/67375/VQC-9ZHB0RB8-V/fulltext.pdf</idno>
<idno type="wicri:Area/Istex/Corpus">003377</idno>
<idno type="wicri:explorRef" wicri:stream="Istex" wicri:step="Corpus" wicri:corpus="ISTEX">003377</idno>
<idno type="wicri:Area/Istex/Curation">003335</idno>
<idno type="wicri:Area/Istex/Checkpoint">000D86</idno>
<idno type="wicri:explorRef" wicri:stream="Istex" wicri:step="Checkpoint">000D86</idno>
<idno type="wicri:doubleKey">1433-7541:2008:Ndong Nguema E:model:based:classification</idno>
<idno type="wicri:Area/Main/Merge">004469</idno>
<idno type="wicri:Area/Main/Curation">004358</idno>
<idno type="wicri:Area/Main/Exploration">004358</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a" type="main" xml:lang="en">Model-based classification with dissimilarities: a maximum likelihood approach</title>
<author>
<name sortKey="Ndong Nguema, Eugene Patrice" sort="Ndong Nguema, Eugene Patrice" uniqKey="Ndong Nguema E" first="Eugène-Patrice" last="Ndong Nguéma">Eugène-Patrice Ndong Nguéma</name>
<affiliation wicri:level="1">
<country xml:lang="fr">Cameroun</country>
<wicri:regionArea>Laboratoire de Mathématiques et Analyse des Systèmes, Ecole Polytechnique, P.O. Box 8390, Yaoundé</wicri:regionArea>
<wicri:noRegion>Yaoundé</wicri:noRegion>
</affiliation>
<affiliation></affiliation>
</author>
<author>
<name sortKey="Saint Pierre, Guillaume" sort="Saint Pierre, Guillaume" uniqKey="Saint Pierre G" first="Guillaume" last="Saint-Pierre">Guillaume Saint-Pierre</name>
<affiliation wicri:level="3">
<country xml:lang="fr">France</country>
<wicri:regionArea>LIVIC, Laboratoire sur les Interactions Véhicules-Infrastructure-Conducteurs, Unité mixte INRETS-LCPC, Bâtiment 824, 14, route de la Minière, Satory, 78000, Versailles</wicri:regionArea>
<placeName>
<region type="region" nuts="2">Île-de-France</region>
<settlement type="city">Versailles</settlement>
</placeName>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">France</country>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="j">Pattern Analysis and Applications</title>
<title level="j" type="abbrev">Pattern Anal Applic</title>
<idno type="ISSN">1433-7541</idno>
<idno type="eISSN">1433-755X</idno>
<imprint>
<publisher>Springer-Verlag</publisher>
<pubPlace>London</pubPlace>
<date type="published" when="2008-09-01">2008-09-01</date>
<biblScope unit="volume">11</biblScope>
<biblScope unit="issue">3-4</biblScope>
<biblScope unit="page" from="281">281</biblScope>
<biblScope unit="page" to="298">298</biblScope>
</imprint>
<idno type="ISSN">1433-7541</idno>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">1433-7541</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Dissimilarity data</term>
<term>Intrinsic data dimension</term>
<term>Maximum likelihood estimate</term>
<term>Model-based classifier</term>
<term>Multidimensional scaling</term>
<term>Success classification rate</term>
</keywords>
</textClass>
<langUsage>
<language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Abstract: Most of classification problems concern applications with objects lying in an Euclidean space, but, in some situations, only dissimilarities between objects are known. We are concerned with supervised classification analysis from an observed dissimilarity table, which task is classifying new unobserved or implicit objects (only known through their dissimilarity measures with previously classified ones forming the training data set) into predefined classes. This work concentrates on developing model-based classifiers for dissimilarities which take into account the measurement error w.r.t. Euclidean distance. Basically, it is assumed that the unobserved objects are unknown parameters to estimate in an Euclidean space, and the observed dissimilarity table is a random perturbation of their Euclidean distances of gaussian type. Allowing the distribution of these perturbations to vary across pairs of classes in the population leads to more flexible classification methods than usual algorithms. Model parameters are estimated from the training data set via the maximum likelihood (ML) method, and allocation is done by assigning a new implicit object to the group in the population and positioning in the Euclidean space maximizing the conditional group likelihood with the estimated parameters. This point of view can be expected to be useful in classifying dissimilarity tables that are no longer Euclidean due to measurement error or instabilities of various types. Two possible structures are postulated for the error, resulting in two different model-based classifiers. First results on real or simulated data sets show interesting behavior of the two proposed algorithms, ant the respective effects of the dissimilarity type and of the data intrinsic dimension are investigated. For these latter two aspects, one of the constructed classifiers appears to be very promising. Interestingly, the data intrinsic dimension seems to have a much less adverse effect on our classifiers than initially feared, at least for small to moderate dimensions.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>Cameroun</li>
<li>France</li>
</country>
<region>
<li>Île-de-France</li>
</region>
<settlement>
<li>Versailles</li>
</settlement>
</list>
<tree>
<country name="Cameroun">
<noRegion>
<name sortKey="Ndong Nguema, Eugene Patrice" sort="Ndong Nguema, Eugene Patrice" uniqKey="Ndong Nguema E" first="Eugène-Patrice" last="Ndong Nguéma">Eugène-Patrice Ndong Nguéma</name>
</noRegion>
</country>
<country name="France">
<region name="Île-de-France">
<name sortKey="Saint Pierre, Guillaume" sort="Saint Pierre, Guillaume" uniqKey="Saint Pierre G" first="Guillaume" last="Saint-Pierre">Guillaume Saint-Pierre</name>
</region>
<name sortKey="Saint Pierre, Guillaume" sort="Saint Pierre, Guillaume" uniqKey="Saint Pierre G" first="Guillaume" last="Saint-Pierre">Guillaume Saint-Pierre</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Wicri/Lorraine/explor/InforLorV4/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 004358 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 004358 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Wicri/Lorraine
   |area=    InforLorV4
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     ISTEX:D91E26C283F26AC855AF776795F9D320D1DFEA03
   |texte=   Model-based classification with dissimilarities: a maximum likelihood approach
}}

Wicri

This area was generated with Dilib version V0.6.33.
Data generation: Mon Jun 10 21:56:28 2019. Site generation: Fri Feb 25 15:29:27 2022